Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 95798 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 10.2 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 3 |
Area_in_hectares is highly overall correlated with Production_in_tons | High correlation |
Crop is highly overall correlated with Crop_Type and 4 other fields | High correlation |
Crop_Type is highly overall correlated with Crop and 1 other fields | High correlation |
K is highly overall correlated with Crop and 2 other fields | High correlation |
N is highly overall correlated with Crop and 2 other fields | High correlation |
P is highly overall correlated with Crop | High correlation |
Production_in_tons is highly overall correlated with Area_in_hectares | High correlation |
State_Name is highly overall correlated with rainfall and 1 other fields | High correlation |
Yield_ton_per_hec is highly overall correlated with K and 1 other fields | High correlation |
pH is highly overall correlated with Crop | High correlation |
rainfall is highly overall correlated with Crop_Type and 1 other fields | High correlation |
temperature is highly overall correlated with State_Name | High correlation |
Unnamed: 0 has unique values | Unique |
Reproduction
| Analysis started | 2025-11-21 05:46:27.145758 |
|---|---|
| Analysis finished | 2025-11-21 05:46:40.536021 |
| Duration | 13.39 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
Unnamed: 0
Real number (ℝ)
Unique
| Distinct | 95798 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49883.041 |
| Minimum | 0 |
|---|---|
| Maximum | 99848 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4882.85 |
| Q1 | 24470.25 |
| median | 50360 |
| Q3 | 74913.75 |
| 95-th percentile | 94858.15 |
| Maximum | 99848 |
| Range | 99848 |
| Interquartile range (IQR) | 50443.5 |
Descriptive statistics
| Standard deviation | 28952.226 |
|---|---|
| Coefficient of variation (CV) | 0.58040218 |
| Kurtosis | -1.2145184 |
| Mean | 49883.041 |
| Median Absolute Deviation (MAD) | 25214.5 |
| Skewness | -0.0063238083 |
| Sum | 4.7786956 × 109 |
| Variance | 8.3823139 × 108 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 99848 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 99832 | 1 | < 0.1% |
| 99831 | 1 | < 0.1% |
| 99829 | 1 | < 0.1% |
| 99828 | 1 | < 0.1% |
| Other values (95788) | 95788 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 99848 | 1 | |
| 99847 | 1 | |
| 99846 | 1 | |
| 99845 | 1 | |
| 99844 | 1 | |
| 99843 | 1 | |
| 99842 | 1 | |
| 99841 | 1 | |
| 99840 | 1 | |
| 99839 | 1 |
State_Name
Categorical
High correlation
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| uttar pradesh | |
|---|---|
| karnataka | |
| madhya pradesh | |
| bihar | |
| odisha | |
| Other values (28) |
Length
| Max length | 27 |
|---|---|
| Median length | 16 |
| Mean length | 9.7809662 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | andhra pradesh |
|---|---|
| 2nd row | andhra pradesh |
| 3rd row | andhra pradesh |
| 4th row | andhra pradesh |
| 5th row | andhra pradesh |
Common Values
| Value | Count | Frequency (%) |
| uttar pradesh | 12536 | |
| karnataka | 8910 | 9.3% |
| madhya pradesh | 8702 | 9.1% |
| bihar | 8437 | 8.8% |
| odisha | 6236 | 6.5% |
| tamil nadu | 5566 | 5.8% |
| assam | 5495 | 5.7% |
| rajasthan | 5358 | 5.6% |
| maharashtra | 4162 | 4.3% |
| west bengal | 3563 | 3.7% |
| Other values (23) | 26833 |
Length
| Value | Count | Frequency (%) |
| pradesh | 26974 | |
| uttar | 12536 | 9.4% |
| karnataka | 8910 | 6.6% |
| madhya | 8702 | 6.5% |
| bihar | 8437 | 6.3% |
| odisha | 6236 | 4.7% |
| tamil | 5566 | 4.2% |
| nadu | 5566 | 4.2% |
| assam | 5495 | 4.1% |
| rajasthan | 5358 | 4.0% |
| Other values (32) | 40286 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 226498 | |
| r | 90749 | |
| h | 88792 | 9.5% |
| t | 67866 | 7.2% |
| s | 61805 | 6.6% |
| d | 56589 | 6.0% |
| n | 42385 | 4.5% |
| e | 40166 | 4.3% |
| 38268 | 4.1% | |
| m | 30913 | 3.3% |
| Other values (14) | 192966 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 936997 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 226498 | |
| r | 90749 | |
| h | 88792 | 9.5% |
| t | 67866 | 7.2% |
| s | 61805 | 6.6% |
| d | 56589 | 6.0% |
| n | 42385 | 4.5% |
| e | 40166 | 4.3% |
| 38268 | 4.1% | |
| m | 30913 | 3.3% |
| Other values (14) | 192966 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 936997 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 226498 | |
| r | 90749 | |
| h | 88792 | 9.5% |
| t | 67866 | 7.2% |
| s | 61805 | 6.6% |
| d | 56589 | 6.0% |
| n | 42385 | 4.5% |
| e | 40166 | 4.3% |
| 38268 | 4.1% | |
| m | 30913 | 3.3% |
| Other values (14) | 192966 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 936997 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 226498 | |
| r | 90749 | |
| h | 88792 | 9.5% |
| t | 67866 | 7.2% |
| s | 61805 | 6.6% |
| d | 56589 | 6.0% |
| n | 42385 | 4.5% |
| e | 40166 | 4.3% |
| 38268 | 4.1% | |
| m | 30913 | 3.3% |
| Other values (14) | 192966 |
Crop_Type
Categorical
High correlation
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| kharif | |
|---|---|
| rabi | |
| whole year | |
| summer |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.452285 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | kharif |
|---|---|
| 2nd row | kharif |
| 3rd row | kharif |
| 4th row | kharif |
| 5th row | kharif |
Common Values
| Value | Count | Frequency (%) |
| kharif | 37785 | |
| rabi | 26878 | |
| whole year | 24271 | |
| summer | 6864 | 7.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| kharif | 37785 | |
| rabi | 26878 | |
| whole | 24271 | |
| year | 24271 | |
| summer | 6864 | 5.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 95798 | |
| a | 88934 | |
| i | 64663 | |
| h | 62056 | |
| e | 55406 | |
| k | 37785 | 6.1% |
| f | 37785 | 6.1% |
| b | 26878 | 4.3% |
| w | 24271 | 3.9% |
| o | 24271 | 3.9% |
| Other values (6) | 100269 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 618116 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 95798 | |
| a | 88934 | |
| i | 64663 | |
| h | 62056 | |
| e | 55406 | |
| k | 37785 | 6.1% |
| f | 37785 | 6.1% |
| b | 26878 | 4.3% |
| w | 24271 | 3.9% |
| o | 24271 | 3.9% |
| Other values (6) | 100269 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 618116 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 95798 | |
| a | 88934 | |
| i | 64663 | |
| h | 62056 | |
| e | 55406 | |
| k | 37785 | 6.1% |
| f | 37785 | 6.1% |
| b | 26878 | 4.3% |
| w | 24271 | 3.9% |
| o | 24271 | 3.9% |
| Other values (6) | 100269 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 618116 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 95798 | |
| a | 88934 | |
| i | 64663 | |
| h | 62056 | |
| e | 55406 | |
| k | 37785 | 6.1% |
| f | 37785 | 6.1% |
| b | 26878 | 4.3% |
| w | 24271 | 3.9% |
| o | 24271 | 3.9% |
| Other values (6) | 100269 |
Crop
Categorical
High correlation
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| rice | |
|---|---|
| maize | |
| moong | |
| wheat | |
| sesamum | |
| Other values (30) |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.1367565 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | cotton |
|---|---|
| 2nd row | horsegram |
| 3rd row | jowar |
| 4th row | maize |
| 5th row | moong |
Common Values
| Value | Count | Frequency (%) |
| rice | 11295 | 11.8% |
| maize | 9368 | 9.8% |
| moong | 6704 | 7.0% |
| wheat | 6177 | 6.4% |
| sesamum | 6081 | 6.3% |
| rapeseed | 5342 | 5.6% |
| potato | 5297 | 5.5% |
| jowar | 5118 | 5.3% |
| onion | 4930 | 5.1% |
| sunflower | 3631 | 3.8% |
| Other values (25) | 31855 |
Length
| Value | Count | Frequency (%) |
| rice | 11295 | 11.8% |
| maize | 9368 | 9.8% |
| moong | 6704 | 7.0% |
| wheat | 6177 | 6.4% |
| sesamum | 6081 | 6.3% |
| rapeseed | 5342 | 5.6% |
| potato | 5297 | 5.5% |
| jowar | 5118 | 5.3% |
| onion | 4930 | 5.1% |
| sunflower | 3631 | 3.8% |
| Other values (25) | 31855 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 75433 | |
| a | 72493 | |
| o | 63516 | |
| r | 50402 | 8.6% |
| t | 38208 | 6.5% |
| i | 36791 | 6.3% |
| n | 34753 | 5.9% |
| m | 34276 | 5.8% |
| s | 30455 | 5.2% |
| c | 24666 | 4.2% |
| Other values (13) | 126896 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 587889 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 75433 | |
| a | 72493 | |
| o | 63516 | |
| r | 50402 | 8.6% |
| t | 38208 | 6.5% |
| i | 36791 | 6.3% |
| n | 34753 | 5.9% |
| m | 34276 | 5.8% |
| s | 30455 | 5.2% |
| c | 24666 | 4.2% |
| Other values (13) | 126896 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 587889 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 75433 | |
| a | 72493 | |
| o | 63516 | |
| r | 50402 | 8.6% |
| t | 38208 | 6.5% |
| i | 36791 | 6.3% |
| n | 34753 | 5.9% |
| m | 34276 | 5.8% |
| s | 30455 | 5.2% |
| c | 24666 | 4.2% |
| Other values (13) | 126896 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 587889 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 75433 | |
| a | 72493 | |
| o | 63516 | |
| r | 50402 | 8.6% |
| t | 38208 | 6.5% |
| i | 36791 | 6.3% |
| n | 34753 | 5.9% |
| m | 34276 | 5.8% |
| s | 30455 | 5.2% |
| c | 24666 | 4.2% |
| Other values (13) | 126896 |
N
Real number (ℝ)
High correlation
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.848744 |
| Minimum | 10 |
|---|---|
| Maximum | 180 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 50 |
| median | 70 |
| Q3 | 80 |
| 95-th percentile | 180 |
| Maximum | 180 |
| Range | 170 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 39.815456 |
|---|---|
| Coefficient of variation (CV) | 0.57002393 |
| Kurtosis | 1.0337132 |
| Mean | 69.848744 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.93193874 |
| Sum | 6691370 |
| Variance | 1585.2705 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 27074 | |
| 50 | 15360 | |
| 20 | 12098 | |
| 120 | 7937 | 8.3% |
| 60 | 6177 | 6.4% |
| 30 | 6081 | 6.3% |
| 180 | 5297 | 5.5% |
| 100 | 4157 | 4.3% |
| 70 | 3786 | 4.0% |
| 90 | 2878 | 3.0% |
| Other values (4) | 4953 | 5.2% |
| Value | Count | Frequency (%) |
| 10 | 2108 | 2.2% |
| 20 | 12098 | |
| 25 | 2535 | 2.6% |
| 30 | 6081 | 6.3% |
| 50 | 15360 | |
| 60 | 6177 | 6.4% |
| 70 | 3786 | 4.0% |
| 75 | 203 | 0.2% |
| 80 | 27074 | |
| 90 | 2878 | 3.0% |
| Value | Count | Frequency (%) |
| 180 | 5297 | 5.5% |
| 160 | 107 | 0.1% |
| 120 | 7937 | 8.3% |
| 100 | 4157 | 4.3% |
| 90 | 2878 | 3.0% |
| 80 | 27074 | |
| 75 | 203 | 0.2% |
| 70 | 3786 | 4.0% |
| 60 | 6177 | 6.4% |
| 50 | 15360 |
P
Real number (ℝ)
High correlation
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.647947 |
| Minimum | 10 |
|---|---|
| Maximum | 125 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 40 |
| median | 40 |
| Q3 | 60 |
| 95-th percentile | 60 |
| Maximum | 125 |
| Range | 115 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 14.849365 |
|---|---|
| Coefficient of variation (CV) | 0.35654496 |
| Kurtosis | 0.60088237 |
| Mean | 41.647947 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.094755041 |
| Sum | 3989790 |
| Variance | 220.50366 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 49916 | |
| 60 | 21898 | |
| 15 | 6249 | 6.5% |
| 30 | 6177 | 6.4% |
| 20 | 5224 | 5.5% |
| 75 | 2551 | 2.7% |
| 10 | 2302 | 2.4% |
| 50 | 1355 | 1.4% |
| 125 | 91 | 0.1% |
| 65 | 35 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 2302 | 2.4% |
| 15 | 6249 | 6.5% |
| 20 | 5224 | 5.5% |
| 30 | 6177 | 6.4% |
| 40 | 49916 | |
| 50 | 1355 | 1.4% |
| 60 | 21898 | |
| 65 | 35 | < 0.1% |
| 75 | 2551 | 2.7% |
| 125 | 91 | 0.1% |
| Value | Count | Frequency (%) |
| 125 | 91 | 0.1% |
| 75 | 2551 | 2.7% |
| 65 | 35 | < 0.1% |
| 60 | 21898 | |
| 50 | 1355 | 1.4% |
| 40 | 49916 | |
| 30 | 6177 | 6.4% |
| 20 | 5224 | 5.5% |
| 15 | 6249 | 6.5% |
| 10 | 2302 | 2.4% |
K
Real number (ℝ)
High correlation
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.967003 |
| Minimum | 10 |
|---|---|
| Maximum | 200 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 20 |
| median | 30 |
| Q3 | 50 |
| 95-th percentile | 100 |
| Maximum | 200 |
| Range | 190 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 28.312965 |
|---|---|
| Coefficient of variation (CV) | 0.67464824 |
| Kurtosis | 2.8072415 |
| Mean | 41.967003 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.7391263 |
| Sum | 4020355 |
| Variance | 801.62398 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 34439 | |
| 40 | 17706 | |
| 30 | 16127 | |
| 90 | 5491 | 5.7% |
| 65 | 4930 | 5.1% |
| 50 | 4263 | 4.4% |
| 45 | 3083 | 3.2% |
| 120 | 2985 | 3.1% |
| 60 | 2770 | 2.9% |
| 100 | 2535 | 2.6% |
| Other values (5) | 1469 | 1.5% |
| Value | Count | Frequency (%) |
| 10 | 120 | 0.1% |
| 20 | 34439 | |
| 30 | 16127 | |
| 40 | 17706 | |
| 45 | 3083 | 3.2% |
| 50 | 4263 | 4.4% |
| 60 | 2770 | 2.9% |
| 65 | 4930 | 5.1% |
| 70 | 35 | < 0.1% |
| 90 | 5491 | 5.7% |
| Value | Count | Frequency (%) |
| 200 | 91 | 0.1% |
| 150 | 203 | 0.2% |
| 140 | 1020 | 1.1% |
| 120 | 2985 | |
| 100 | 2535 | |
| 90 | 5491 | |
| 70 | 35 | < 0.1% |
| 65 | 4930 | |
| 60 | 2770 | |
| 50 | 4263 |
pH
Real number (ℝ)
High correlation
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.6367209 |
| Minimum | 3.82 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 3.82 |
|---|---|
| 5-th percentile | 4.92 |
| Q1 | 5.36 |
| median | 5.54 |
| Q3 | 5.92 |
| 95-th percentile | 6.6 |
| Maximum | 7 |
| Range | 3.18 |
| Interquartile range (IQR) | 0.56 |
Descriptive statistics
| Standard deviation | 0.50210457 |
|---|---|
| Coefficient of variation (CV) | 0.089077422 |
| Kurtosis | -0.02829505 |
| Mean | 5.6367209 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 0.60625494 |
| Sum | 539986.59 |
| Variance | 0.252109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.36 | 2826 | 2.9% |
| 5.42 | 2781 | 2.9% |
| 5.4 | 2769 | 2.9% |
| 5.38 | 2760 | 2.9% |
| 5.32 | 2749 | 2.9% |
| 5.6 | 2748 | 2.9% |
| 5.62 | 2715 | 2.8% |
| 5.68 | 2709 | 2.8% |
| 5.54 | 2708 | 2.8% |
| 5.5 | 2708 | 2.8% |
| Other values (91) | 68325 |
| Value | Count | Frequency (%) |
| 3.82 | 11 | |
| 3.84 | 6 | |
| 3.86 | 11 | |
| 3.88 | 10 | |
| 3.9 | 12 | |
| 3.92 | 14 | |
| 3.94 | 13 | |
| 3.96 | 10 | |
| 3.98 | 11 | |
| 4 | 7 |
| Value | Count | Frequency (%) |
| 7 | 548 | |
| 6.9 | 548 | |
| 6.8 | 578 | |
| 6.7 | 540 | |
| 6.68 | 576 | |
| 6.66 | 588 | |
| 6.64 | 604 | |
| 6.62 | 551 | |
| 6.6 | 1124 | |
| 6.58 | 587 |
rainfall
Real number (ℝ)
High correlation
| Distinct | 111 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 697.17581 |
| Minimum | 3.274569 |
|---|---|
| Maximum | 3322.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 3.274569 |
|---|---|
| 5-th percentile | 41.3 |
| Q1 | 157.31 |
| median | 579.75 |
| Q3 | 1110.78 |
| 95-th percentile | 1712.66 |
| Maximum | 3322.06 |
| Range | 3318.7854 |
| Interquartile range (IQR) | 953.47 |
Descriptive statistics
| Standard deviation | 604.18354 |
|---|---|
| Coefficient of variation (CV) | 0.86661575 |
| Kurtosis | 1.475034 |
| Mean | 697.17581 |
| Median Absolute Deviation (MAD) | 446.89 |
| Skewness | 1.1354534 |
| Sum | 66788049 |
| Variance | 365037.75 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 579.75 | 4871 | 5.1% |
| 75.32 | 4794 | 5.0% |
| 1011.49 | 3485 | 3.6% |
| 884.5 | 3451 | 3.6% |
| 1111.68 | 3423 | 3.6% |
| 1246.715 | 3106 | 3.2% |
| 840.46 | 2717 | 2.8% |
| 87.2 | 2584 | 2.7% |
| 510.05 | 2550 | 2.7% |
| 607.48 | 2503 | 2.6% |
| Other values (101) | 62314 |
| Value | Count | Frequency (%) |
| 3.274569 | 45 | < 0.1% |
| 3.94 | 106 | 0.1% |
| 5.274 | 31 | < 0.1% |
| 9.627044 | 16 | < 0.1% |
| 10.265748 | 70 | 0.1% |
| 15.34 | 594 | 0.6% |
| 19.38 | 1360 | |
| 34.81 | 1677 | |
| 35.214 | 23 | < 0.1% |
| 37.09 | 234 | 0.2% |
| Value | Count | Frequency (%) |
| 3322.06 | 73 | 0.1% |
| 3041.4 | 18 | < 0.1% |
| 2879.86 | 29 | < 0.1% |
| 2817.86 | 1386 | |
| 2569.52 | 272 | 0.3% |
| 2459.64 | 8 | < 0.1% |
| 2169.32 | 2399 | |
| 1997.12 | 339 | 0.4% |
| 1925.68 | 20 | < 0.1% |
| 1875.6 | 136 | 0.1% |
temperature
Real number (ℝ)
High correlation
| Distinct | 109 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.639702 |
| Minimum | 1.18 |
|---|---|
| Maximum | 35.346667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 20.1 |
| Q1 | 22.890909 |
| median | 27.276 |
| Q3 | 29.266667 |
| 95-th percentile | 34.01 |
| Maximum | 35.346667 |
| Range | 34.166667 |
| Interquartile range (IQR) | 6.3757576 |
Descriptive statistics
| Standard deviation | 4.8947147 |
|---|---|
| Coefficient of variation (CV) | 0.18373759 |
| Kurtosis | 2.2810095 |
| Mean | 26.639702 |
| Median Absolute Deviation (MAD) | 3.3406667 |
| Skewness | -0.76221439 |
| Sum | 2552030.2 |
| Variance | 23.958232 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34.01 | 4871 | 5.1% |
| 22.676 | 4794 | 5.0% |
| 30.43 | 3485 | 3.6% |
| 27.65454545 | 3451 | 3.6% |
| 28.64818182 | 3423 | 3.6% |
| 22.6 | 3106 | 3.2% |
| 33.58333333 | 2717 | 2.8% |
| 23.106 | 2584 | 2.7% |
| 33.37333333 | 2550 | 2.7% |
| 26.36666667 | 2503 | 2.6% |
| Other values (99) | 62314 |
| Value | Count | Frequency (%) |
| 1.18 | 170 | 0.2% |
| 4.9 | 272 | |
| 10.38 | 544 | |
| 11.2 | 464 | |
| 12.5 | 137 | 0.1% |
| 14.6 | 582 | |
| 14.7 | 326 | |
| 15.5 | 162 | 0.2% |
| 15.61818182 | 8 | < 0.1% |
| 15.852 | 246 |
| Value | Count | Frequency (%) |
| 35.34666667 | 730 | 0.8% |
| 34.92333333 | 1165 | 1.2% |
| 34.73 | 576 | 0.6% |
| 34.66666667 | 1677 | 1.8% |
| 34.01 | 4871 | |
| 33.76333333 | 106 | 0.1% |
| 33.58333333 | 2717 | |
| 33.37333333 | 2550 | |
| 30.61666667 | 1300 | 1.4% |
| 30.43 | 3485 |
Area_in_hectares
Real number (ℝ)
High correlation
| Distinct | 25977 |
|---|---|
| Distinct (%) | 27.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16772.773 |
| Minimum | 0.58 |
|---|---|
| Maximum | 726300 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0.58 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 140 |
| median | 1087 |
| Q3 | 8500 |
| 95-th percentile | 100626.9 |
| Maximum | 726300 |
| Range | 726299.42 |
| Interquartile range (IQR) | 8360 |
Descriptive statistics
| Standard deviation | 43856.481 |
|---|---|
| Coefficient of variation (CV) | 2.6147424 |
| Kurtosis | 31.42573 |
| Mean | 16772.773 |
| Median Absolute Deviation (MAD) | 1065 |
| Skewness | 4.7182201 |
| Sum | 1.6067981 × 109 |
| Variance | 1.9233909 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 568 | 0.6% |
| 2 | 562 | 0.6% |
| 3 | 528 | 0.6% |
| 4 | 521 | 0.5% |
| 1 | 495 | 0.5% |
| 10 | 492 | 0.5% |
| 6 | 465 | 0.5% |
| 7 | 424 | 0.4% |
| 8 | 416 | 0.4% |
| 15 | 405 | 0.4% |
| Other values (25967) | 90922 |
| Value | Count | Frequency (%) |
| 0.58 | 1 | < 0.1% |
| 1 | 495 | |
| 1.5 | 1 | < 0.1% |
| 1.62 | 2 | < 0.1% |
| 2 | 562 | |
| 2.08 | 1 | < 0.1% |
| 2.5 | 2 | < 0.1% |
| 2.57 | 1 | < 0.1% |
| 2.78 | 1 | < 0.1% |
| 2.9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 726300 | 1 | |
| 712900 | 1 | |
| 711300 | 1 | |
| 699900 | 1 | |
| 687500 | 1 | |
| 686900 | 1 | |
| 672100 | 1 | |
| 657600 | 1 | |
| 641200 | 1 | |
| 636700 | 1 |
Production_in_tons
Real number (ℝ)
High correlation
| Distinct | 32495 |
|---|---|
| Distinct (%) | 33.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37057.867 |
| Minimum | 0.01 |
|---|---|
| Maximum | 2589591 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 179 |
| median | 1575 |
| Q3 | 14601.5 |
| 95-th percentile | 216134 |
| Maximum | 2589591 |
| Range | 2589591 |
| Interquartile range (IQR) | 14422.5 |
Descriptive statistics
| Standard deviation | 116817.9 |
|---|---|
| Coefficient of variation (CV) | 3.1523104 |
| Kurtosis | 70.734397 |
| Mean | 37057.867 |
| Median Absolute Deviation (MAD) | 1555 |
| Skewness | 6.8213819 |
| Sum | 3.5500696 × 109 |
| Variance | 1.3646422 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 546 | 0.6% |
| 1 | 538 | 0.6% |
| 3 | 491 | 0.5% |
| 10 | 455 | 0.5% |
| 4 | 438 | 0.5% |
| 5 | 422 | 0.4% |
| 6 | 417 | 0.4% |
| 100 | 383 | 0.4% |
| 8 | 378 | 0.4% |
| 7 | 363 | 0.4% |
| Other values (32485) | 91367 |
| Value | Count | Frequency (%) |
| 0.01 | 5 | < 0.1% |
| 0.1 | 33 | |
| 0.2 | 16 | |
| 0.3 | 15 | |
| 0.31 | 1 | < 0.1% |
| 0.38 | 1 | < 0.1% |
| 0.4 | 18 | |
| 0.5 | 20 | |
| 0.51 | 1 | < 0.1% |
| 0.55 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2589591 | 1 | |
| 2465212 | 1 | |
| 2410963 | 1 | |
| 2390840 | 1 | |
| 2356389 | 1 | |
| 2350043 | 1 | |
| 2343257 | 1 | |
| 2337693 | 1 | |
| 2070497 | 1 | |
| 2047918 | 1 |
Yield_ton_per_hec
Real number (ℝ)
High correlation
| Distinct | 71051 |
|---|---|
| Distinct (%) | 74.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6028547 |
| Minimum | 0.00051413882 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0.00051413882 |
|---|---|
| 5-th percentile | 0.20450372 |
| Q1 | 0.59816525 |
| median | 1.3278995 |
| Q3 | 2.8978943 |
| 95-th percentile | 15.451731 |
| Maximum | 93 |
| Range | 92.999486 |
| Interquartile range (IQR) | 2.2997291 |
Descriptive statistics
| Standard deviation | 6.7294769 |
|---|---|
| Coefficient of variation (CV) | 1.867818 |
| Kurtosis | 27.743528 |
| Mean | 3.6028547 |
| Median Absolute Deviation (MAD) | 0.87386048 |
| Skewness | 4.4412684 |
| Sum | 345146.28 |
| Variance | 45.285859 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 982 | 1.0% |
| 0.5 | 695 | 0.7% |
| 2 | 479 | 0.5% |
| 0.3333333333 | 349 | 0.4% |
| 1.5 | 297 | 0.3% |
| 0.6666666667 | 289 | 0.3% |
| 0.6 | 250 | 0.3% |
| 3 | 247 | 0.3% |
| 0.4 | 242 | 0.3% |
| 0.25 | 225 | 0.2% |
| Other values (71041) | 91743 |
| Value | Count | Frequency (%) |
| 0.0005141388175 | 1 | |
| 0.0008132169149 | 1 | |
| 0.00117319255 | 1 | |
| 0.001227747084 | 1 | |
| 0.001277732605 | 1 | |
| 0.001282051282 | 1 | |
| 0.001377410468 | 1 | |
| 0.001684919966 | 1 | |
| 0.00243902439 | 1 | |
| 0.003188697295 | 1 |
| Value | Count | Frequency (%) |
| 93 | 1 | |
| 91 | 2 | |
| 90.83333333 | 1 | |
| 90.82608696 | 1 | |
| 90.82352941 | 1 | |
| 90.81578947 | 1 | |
| 90.81428571 | 1 | |
| 90.81343284 | 1 | |
| 90.8125 | 1 | |
| 90.80748663 | 1 |
Interactions
Correlations
| Area_in_hectares | Crop | Crop_Type | K | N | P | Production_in_tons | State_Name | Unnamed: 0 | Yield_ton_per_hec | pH | rainfall | temperature | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Area_in_hectares | 1.000 | 0.142 | 0.105 | -0.131 | 0.069 | -0.091 | 0.898 | 0.093 | -0.037 | -0.005 | 0.050 | -0.139 | -0.045 |
| Crop | 0.142 | 1.000 | 0.633 | 1.000 | 1.000 | 1.000 | 0.124 | 0.152 | 0.087 | 0.333 | 0.689 | 0.307 | 0.243 |
| Crop_Type | 0.105 | 0.633 | 1.000 | 0.436 | 0.405 | 0.399 | 0.064 | 0.277 | 0.067 | 0.206 | 0.328 | 0.588 | 0.451 |
| K | -0.131 | 1.000 | 0.436 | 1.000 | 0.527 | 0.214 | 0.120 | 0.150 | 0.005 | 0.555 | -0.109 | 0.258 | -0.053 |
| N | 0.069 | 1.000 | 0.405 | 0.527 | 1.000 | 0.261 | 0.336 | 0.157 | 0.007 | 0.631 | -0.156 | 0.111 | 0.030 |
| P | -0.091 | 1.000 | 0.399 | 0.214 | 0.261 | 1.000 | 0.063 | 0.157 | 0.004 | 0.268 | -0.245 | 0.139 | -0.031 |
| Production_in_tons | 0.898 | 0.124 | 0.064 | 0.120 | 0.336 | 0.063 | 1.000 | 0.125 | -0.017 | 0.408 | -0.007 | -0.088 | -0.062 |
| State_Name | 0.093 | 0.152 | 0.277 | 0.150 | 0.157 | 0.157 | 0.125 | 1.000 | 0.122 | 0.114 | 0.091 | 0.636 | 0.560 |
| Unnamed: 0 | -0.037 | 0.087 | 0.067 | 0.005 | 0.007 | 0.004 | -0.017 | 0.122 | 1.000 | 0.045 | -0.000 | -0.044 | -0.033 |
| Yield_ton_per_hec | -0.005 | 0.333 | 0.206 | 0.555 | 0.631 | 0.268 | 0.408 | 0.114 | 0.045 | 1.000 | -0.139 | 0.058 | -0.060 |
| pH | 0.050 | 0.689 | 0.328 | -0.109 | -0.156 | -0.245 | -0.007 | 0.091 | -0.000 | -0.139 | 1.000 | -0.016 | 0.030 |
| rainfall | -0.139 | 0.307 | 0.588 | 0.258 | 0.111 | 0.139 | -0.088 | 0.636 | -0.044 | 0.058 | -0.016 | 1.000 | 0.156 |
| temperature | -0.045 | 0.243 | 0.451 | -0.053 | 0.030 | -0.031 | -0.062 | 0.560 | -0.033 | -0.060 | 0.030 | 0.156 | 1.000 |
Missing values
Sample
| Unnamed: 0 | State_Name | Crop_Type | Crop | N | P | K | pH | rainfall | temperature | Area_in_hectares | Production_in_tons | Yield_ton_per_hec | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | andhra pradesh | kharif | cotton | 120 | 40 | 20 | 5.46 | 654.34 | 29.266667 | 7300.0 | 9400.0 | 1.287671 |
| 1 | 1 | andhra pradesh | kharif | horsegram | 20 | 60 | 20 | 6.18 | 654.34 | 29.266667 | 3300.0 | 1000.0 | 0.303030 |
| 2 | 2 | andhra pradesh | kharif | jowar | 80 | 40 | 40 | 5.42 | 654.34 | 29.266667 | 10100.0 | 10200.0 | 1.009901 |
| 3 | 3 | andhra pradesh | kharif | maize | 80 | 40 | 20 | 5.62 | 654.34 | 29.266667 | 2800.0 | 4900.0 | 1.750000 |
| 4 | 4 | andhra pradesh | kharif | moong | 20 | 40 | 20 | 5.68 | 654.34 | 29.266667 | 1300.0 | 500.0 | 0.384615 |
| 5 | 5 | andhra pradesh | kharif | ragi | 50 | 40 | 20 | 5.64 | 654.34 | 29.266667 | 6700.0 | 11800.0 | 1.761194 |
| 6 | 6 | andhra pradesh | kharif | rice | 80 | 40 | 40 | 5.54 | 654.34 | 29.266667 | 35600.0 | 75400.0 | 2.117978 |
| 7 | 7 | andhra pradesh | kharif | sunflower | 50 | 60 | 30 | 5.36 | 654.34 | 29.266667 | 35900.0 | 11100.0 | 0.309192 |
| 8 | 8 | andhra pradesh | rabi | horsegram | 20 | 60 | 20 | 6.00 | 288.30 | 25.460000 | 600.0 | 200.0 | 0.333333 |
| 9 | 9 | andhra pradesh | rabi | jowar | 80 | 40 | 40 | 5.50 | 288.30 | 25.460000 | 18800.0 | 9400.0 | 0.500000 |
| Unnamed: 0 | State_Name | Crop_Type | Crop | N | P | K | pH | rainfall | temperature | Area_in_hectares | Production_in_tons | Yield_ton_per_hec | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99839 | 99839 | west bengal | kharif | moong | 20 | 40 | 20 | 5.50 | 1166.94 | 28.333333 | 293.0 | 136.0 | 0.464164 |
| 99840 | 99840 | west bengal | kharif | sunflower | 50 | 60 | 30 | 5.62 | 1166.94 | 28.333333 | 37.0 | 40.0 | 1.081081 |
| 99841 | 99841 | west bengal | rabi | moong | 20 | 40 | 20 | 5.62 | 152.54 | 22.280000 | 52.0 | 42.0 | 0.807692 |
| 99842 | 99842 | west bengal | rabi | potato | 180 | 60 | 90 | 4.84 | 152.54 | 22.280000 | 977.0 | 15920.0 | 16.294780 |
| 99843 | 99843 | west bengal | rabi | rapeseed | 50 | 40 | 20 | 5.12 | 152.54 | 22.280000 | 886.0 | 542.0 | 0.611738 |
| 99844 | 99844 | west bengal | rabi | wheat | 60 | 30 | 30 | 6.70 | 152.54 | 22.280000 | 2013.0 | 5152.0 | 2.559364 |
| 99845 | 99845 | west bengal | summer | maize | 80 | 40 | 20 | 5.68 | 182.50 | 29.200000 | 258.0 | 391.0 | 1.515504 |
| 99846 | 99846 | west bengal | summer | rice | 80 | 40 | 40 | 5.64 | 182.50 | 29.200000 | 105.0 | 281.0 | 2.676190 |
| 99847 | 99847 | west bengal | rabi | rice | 80 | 40 | 40 | 5.42 | 152.54 | 22.280000 | 152676.0 | 261435.0 | 1.712352 |
| 99848 | 99848 | west bengal | rabi | sesamum | 30 | 15 | 30 | 6.54 | 152.54 | 22.280000 | 244.0 | 95.0 | 0.389344 |